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(57) ABSTRACT 

Apparatus, and a method for switching digital signals of 
different protocols in a large switching network. Signals and 
different protocols are encapsulated in a common protocol 
by a set of input modules. The signals in the common 
protocol are then switched by a large central stage to a 
plurality of output modules, which then generate signals in 
the correct protocol. Narrow-band signals are converted into 
broadband signals, so that the central stage switches only 
broadband signals. This simplifies the design of the central 
stage, while still making it possible to have a very large and 
economical central stage switch. Advantageously, this 
arrangement allows a single large switch to switch large 
quantities of traffic in a plurality of protocols. 
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LARGE COMBINED BROADBAND AND 
NARROWBAND SWITCH 

RELATED APPLICATION 

This Application is related to an Application entitled, 
"Flexible Telecommunications Switching Network" being 
filed by C. J, Chrin and M. J. Zola, concurrently, with this 
Application, and assigned to the same assignee. 

Technical Field 

This invention relates to digital telecommunications 
switching systems, and more specifically, to narrowband and 
broadband digital telecommunications switching systems. 
Problem 

As is well known, the demands for telecommunications 
services are changing rapidly. Many of these demands are 
for services such as video, which require a much broader 
band than the conventional audio band. The combination of 
cable television distribution systems with telephone and 
other customer premises equipment telecommunications 
systems, is making more and more available a loop plant 
capable of transmitting broadband signals to individual 
customers. At the same time, the development of switching 
systems which handle both broadband and narrowband 
signals in an efficient and integrated way, has not kept pace 
with this demand. Accordingly, a problem of the prior art is 
that there is no satisfactory large switching system for 
processing both narrowband and broadband traffic efficiently 
and an integrated fashion. 
Solution 

The above problem is solved, and an advance is made 
over the prior art in accordance with this invention, wherein 
a core central network receives inputs in a standard format 
from a plurality of connecting modules, and switches these 
signals among the modules. In order to handle a large 
amount of broadband traffic efficiently, these signals each 
carry a substantial number of base band channels combined 
into a single signal. In accordance with Applicant's preferred 
embodiment, the standard signal is a signal that carries 2.043 
Mb per second payload, the equivalent of 32 DSO voice 
channels. Each such signal is switched through the central 
core entity without changing the information in the payload. 

In accordance with one feature of Applicants' invention, 
when a narrowband signal, i.e., a single voice channel, is to 
be switched through the core, this narrowband signal occu- 
pies only a single DSO payload, with the rest of the signal 
being occupied by "fill" bits. Advantageously, this allows 
individual voice channels to be switched through the central 
core from any connected module to any other connected 
module. 

In accordance with Applicants' preferred embodiment, 
the modules connected to line and trunk signals convert 
these signals into standard format signals. These modules 
include a microprocessor based switch, wherein the micro- 
processor receives input pulse code modulated signals gen- 
erated by standard line and trunk interface cards, and con- 
verts these into standard intermediate stage signals, and 
switches signals from particular lines and trunks to a stan- 
dard signal destined for the appropriate module at the other 
end. In addition, output interface units for interfacing 
between facilities such as SONET/SDH, (Synchronous 
Optical Network/Synchronous Digital Hierarchy) facilities, 
and the cores which are connected to the core switch, with 
the conununi cations being the standard signals. These output 
interface units convert between the standard signal and an 
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ATM, or IP signal, for transmission over virtual channels of 
the SONET facilities. 

The standard signals in this embodiment are VT2 signals 
and carry 32 64 Kbit channels, and are further characterized 

5 in the discussion of FIGS. 17-20. 

Advantageously, using 1999 technology, a core network 
for supporting 12288 (768x16), VT2 signals, can be housed 
on a single custom silicon chip. This is a great advantage 
compared to disbursing this circuitry over many chips, since 

10 intra-chip connections do not require interface circuits, and 
can use the very narrow circuit interconnection lines allow- 
able on a silicon chip. Advantageously, by allowing one 
narrowband signal to occupy to a full VT2 signal on a full 
interconnectivity among modules connected to the core 

15 network, is possible without requiring the core network to 
switch anything other than VT2 signals. 

In accordance with another feature of Applicants* 
invention, connections to prior art modules is made possible. 
Prior art modules such as the communication module of a 

20 5ESS® switch, manufactured by Lucent Technologies Inc., 
communicate via proprietary link signals, which are encap- 
sulated at the input of the core network into VT2 signals. 
These NCT links can carry both narrowband and broadband 
signals. The communication module can be connected to a 

25 switch module of a 5ESS® switch. 

In accordance with another aspect of Applicants' 
invention, a group of modules are interconnected by a 
plurality of center stage modules as in a Clos network, a 
well-known arrangement for guaranteeing absence of block- 

30 ing from any input to any output. Advantageously, a large 
non-blocking network is made feasible by this arrangement. 

BRIEF DESCRIPTION OF THE DRAWINGS(S) 

FIG. 1 is a block diagram of a microprocessor based 
35 time-slot interchange module; 

FIG. 2 is a block diagram of the internal microprocessor 
architecture of the TSI module; 

FIG. 3 is a block diagram of memory and buffer layouts 
for the TSI module; 

40 

FIG. 4 is a flow diagram of the program for controlling the 
TSI module; 

FIG. 5 shows modifications of the program of FIG. 4, for 
the case in which the multiple sub-frames are encountered 
45 between each frame synchronizing pulse. 

FIG. 6 illustrates the program for controlling the 
microprocessor, acting as a time multiplexed switch; 

FIG. 7 shows the program for a TSI, wherein groups of 
time-slots are bundled and switched as a bundle; 
50 FIG. 8 illustrates a cell header for an asynchronous 
transport method, (ATM), cell header; 

FIG. 9 illustrates the basic operation of the processing of 
an ATM cell; 

FIG. 10 is a programmer's module, illustrating the layout 
of memory for a microprocessor acting as a ATM switch; 

FIG. 11 is a flow diagram, illustrating the operation of the 
input processing of a microprocessor ATM switch; 

FIG. 12 illustrates the processing of the output queue of 
60 the microprocessor program for an ATM switch; 

FIG. 13 illustrates the process of processing ATM cells to 
output links of a microprocessor ATM switch; 

FIG. 14 is a block diagram, illustrating the arrangement 
for expanding the size of a microprocessor control switch 
65 through replication of microprocessor complexes; 

FIG. 15 shows a CLOS network of switching micropro- 
cessors; 
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FIG. 16 is a conceptual diagram, illustrating a 3-stage 2. This data, is then stored in the internal microprocessor 

network for use as a combined circuit and packet switch; cache memory which may be expanded by a level 2 

FIG. 17 is a detailed block diagram of such a 3-stage «*e on or outside the microprocessor chip, and/or an 

network: auxiliary memory outside the microprocessor chip, and 

„ , 5 is manipulated under stored program control in such a 

FIG. 18 shows details of the connections to the central way ^ lQ provide the desired switchillg function. 

fabric of FIG. 17, 3 The mofe ^q^Ay sections of the stored pro- 

FIG. 19 shows arrangements for interconnecting a phi- gram are, advantageously, stored in caches, 

rality of the central stages of FIG. 17; and 4 ^ outputs m rcad out in para u c l to parallel 

FIG. 20 is a more detailed block diagram of the core fabric 10 m> serial out shift registers, and then clocked out onto 

of FIG. 17. serial bit streams. 

FIG. 1 is a block diagram of the basic system architecture. 

GENERAL DESCRIPTION ncart 0 f mc system is a Microprocessor containing on 

The descriptions of FIGS. 1-14 relate to single-stage board and ^ caches and external Input/Output 

networks for performing the TSI function, TMS function, 15 consisting of serial to parallel shift registers as Input .Buffers, 

and for performing within the same module, also functions ™A parallel to serial shift registers as Output Buffers. An 

of packet switching for switching ATM signals, and signals Input/Output decoder under the control of the Microproces- 

in the Internet Protocol, (IP). FIG. 14 shows how a group of sor sclects Io P ut Buffcr which P uts lts data on the Data 

smaller modules can be paralleled to perform the function of Bus or * c 0ut P ut Buffcr > which rcads the data frora the daU 

a larger module. FIG. 15 shows a Clos network made of the 20 bus - ^ C 0 ™™ 1 Register is used to receive and transmit 

modules described earlier. FIG. 16 is a conceptual diagram, contro1 messages from/to the outside world. The external 

illustrating a 3-stage network for use as a combined circuit Memory is used to store back-up and maintenance code, as 

and packet switch. The subject matter of FIGS. 17-20, weU as data structures which are too large to fit into the 

which is the bulk of the claimed subject matter of this cache. Microprocessor software is used to provide switching 

Application, uses the modules described in the earlier part of 25 functionality which has traditionally been provided by hard- 

the Application, but interconnects these modules with a ware implementation. Thus, multiple diverse functions can 

novel type of center stage, to create a very large switch. be provided concurrently by a single microprocessor archi- 
tecture. By changing the resident software, different sets of 

DETAILED DESCRIPTION functions can be provided. 

30 FIG. 1 shows the input and output data streams of a 

This specification describes an arrangement for, and time-slot interchange unit 100 in accordance with Appti- 
method of implementing multiple hardware functionality by cant > s invention. In one preferred embodiment of Appli- 
appropriate software on a Reduced Instruction Set cant ' s invention, a 300 MHZ PowerPC® EC 603e 
Computer, (RISC) microprocessor. Although this is microprocessor, manufactured by the Motorola Corporation, 
described in terms of a RISC microprocessor, other types of 35 can switch 192 serial input and output streams, each con- 
microprocessor implementations, (e.g., Complex Instruction sisting of 32 time-slots at a bit rate of 2.048 Mbits per 
Set Computer), (CISC) can also be used. Multiple functions second. The input comprises n serial input streams, stream 
can reside on the same microprocessor simultaneously, or De i ng connected to input buffer 101, . . . , and serial 
only a single function can be provided. The determination of mput stream, n-1, being connected to input buffer 102. The 
the type of treatment each input and/or output shall receive, 40 nrst mput str eam is collected in a shift register of input 
(e.g., Time-Slot Interchange (TSI), Time Multiplexed bu ff er jqi, and then transmitted in parallel, sequentially to 
Switch (TMS), Cross-connect (XCON), Asynchronous a four-stage, sixty-four bit per stage, buffer. The last stage of 
Transfer Mode (ATM) Switch, Internet Protocol (IP) Router, m is buffer is connected to a series of sixty -four tri-state bus 
Dynamic Synchronous Transfer Mode (DTM), Frame Relay drivers for driving parallel bus 105. Also connected to 
(FR) switch, etc.), is determined by, and can be reconfigured 45 parallel bus 105, are n output buffers 111, . . . , 112. These 
by, software control. Conversion between input to output outputs buffers also comprise four-stage, sixty-four bit 
formats can also be provided, e.g., a circuit switched PCM registers, the input stage of which is connected to sixty-four 
format can be converted to/from ATM format. A method is DUS receivers connected to bus 105, and the output stage of 
identified for using multiple microprocessors for building which is connected to a shift register for generating a serial 
very large configurations for applications which do not fit on 50 output stream. Also connected to bus 105, is microprocessor 
a single microprocessor. 120 which accepts inputs in bursts of 256 bits as four 

Other advantages include: associated 64-bit data bus reads, from each of the n input 

1. Little or no VLSI Required— faster time to market buffers 101, . . . , 102, under the control of the program 
(eliminates development). stored in the microprocessor. Similarly, the microprocessor 

2. Microprocessor Self Test-<reduces investment in chip 55 delivers bursts ° f 256 bits as four associated 64 bit data bus 
and board test tools} writes to each of the n output buffers after having generated 

„ _ „ , T rr, . ^ 1 the output burst through the reading of the inputs under the 

3. Follows Moore s Law Technology Curve Directly- of a cOQtrol ^ of ^ m of ^ micro _ 

(graceful evolution). processor. 

4. Core Architecture Usable by Multiple Applications. ^ ^ j /0 decoder ^lit 13 o, under the control of the 

5. Results in Lower Development Effort — (contributed to microprocessor, is used to gate the tri-state outputs of the 
by all of the above). input buffers onto the bus, and to gate the output of the bus 

Among the basic techniques used are the following: into the n output buffers 111, . . . , 112. The I/O decoder 

1. Input bit streams are clocked into serial in, parallel out receives inputs from the microprocessor address bus. 

shift registers and then read out in parallel onto the 65 Also connected to bus 105, is a memory 122 for storing 

microprocessor data bus under control of the micro- infrequently used data and program text such as data 

processor address bus. required for performing tests or diagnostics, non-cached TSI 
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code, and as a backup for data stored in the microprocessor accordance with Applicant's invention. The process starts 

cache, such as the microprocessor program text and the path with the microprocessor waiting for a frame synchronization 

memory. Also connected to bus 105, is control register 124, pulse (Block 401). When the frame synchronization pulse 

which interfaces with a call processing controller or other arrives, it signals the beginning of the synchronized loading 

switches of the telecommunications network, and receives 5 0 f tne input buffers 101, . . . , 102, from the serial input 

and transmits control messages. streams, and triggers several initialization steps. The 

FIG. 2 is a block diagram of those key parts of the memory write address, (TSI write pointer 315), is initialized, 

microprocessor which are pertinent to the understanding of (Action Block 402)? ^ ^ the ^necl location in the TSI 

Applicant s invention. The microprocessor contains a pro- buffef m ^ 303 ^ ^jy^ for thc Motion 

gram cache 201 for storing the control program which from ^ m buffcR m m ^ buffefi 

controls the operations of the time-slot interchange unit. The ~ A . 4 , , fK . ™ * *m\ %. . ,_ c _ 

output of the program cache goes to an instruction queue 203 of3ket 15 &5 lon J B1 °^ * 03 >' to choose either framc 

for storing a plurality of instructions in order to allowfor the mcmor y ? 01 or 3 ° 3 m the TSI buffer for storing the input 

rapid execution of simple loops that is made possible using data on alternate frames. The microprocessor \hcn waits ► for 

pipelining techniques. The instruction queue interacts with m P ut buffcr loaded ( Acb0D Block 4(M >' which 

an instruction Control Block 205, to deliver the appropriate 15 establishes that the buffers 101, . . . , 102, are full, and then 

instructions to arithmetic and logic unit (ALU) 207. The the input buffer address is initialized, (Action Block 405), to 

ALU executes its received instructions and operates to point to the first input buffer 101. In order to guarantee that 

perform the steps required by the instruction, by controlling Action Block 406 reads new data from the input buffer, and 

load store unit 213, which in turn accesses a data cache 211. not stale cached data from a previous cycle, Action Block 

ALU 207 also controls a group of internal registers 215, for 20 405 invalidates the cache data associated with the input 

short term storage, and for the control of the microprocessor. buffer address before initiating the read. The input buffer 

A bus interface 217 communicates between bus 105, (FIG. pointed to by the input buffer address is then read, (Action 

1), and within the microprocessor with the data cache 211, Block 406), in a burst as four connected 64 bit data bus 

and for changes or back-up in the software, also communi- operations, and stored in the microprocessor cache memory 

cates with program cache 201. 25 in either TSI buffer 301 or 303, depending on the double 

FIG. 3 shows pertinent memory data stored in data cache buffering offset. Test 407 determines whether all inputs for 

211 of microprocessor 120, and in hardware registers. The this frame have been written. If not, then the buffer address 

contents of the data cache contain, among other items, the is incremented, (Action Block 409), and the next buffer is 

data received from input buffers 101, . . . , 102, and the data read into the TSI buffer, (Action Block 406, previously 

to be delivered to output buffers 111, . . . , 112. Data received 30 described). This loop is continued until the results of test 407 

from the input buffers 101, . . . , 102, is stored in TSI buffer indicates that all inputs for this frame have been written. 

301 or 303. The data from the various input buffers is stored At this point, the TSI read cycle begins. The output buffer 

sequentially in one of these buffers in Applicant's preferred address 334 is initialized, (Action Block 421), the TSI output 

embodiment. In order to handle nx64 kilobit per second buffer address is initialized, (Action Block 423), and the 

connections, the TSI buffer contains buffer 301, and a 35 control map pointer 313 is initialized to point to the top of 

second buffer 303, for storing another frame of this serial the control map, (Action Block 425). The contents of the 

input data. Buffers 301 and 303 are used alternately. Control control map are read to an index register, (Action Block 

map 311 is used to control the reading of the contents of TSI 427), and the index register is used to read the eight bit 

buffers 301 or 303 in order to generate an output for storage time-slot from the TSI buffer, (Action Block 429); (frame 

in the TSI output buffer 321, for transmission to one of the 40 301 or 303 is accessed depending on the double buffering 

output buffers 111, . . . , 112. TSI write pointer 315 is used offset established in Action Block 403). The read byte is then 

to keep track of where the next input from one of the input written into the TSI output buffer in the cache, at the 

buffers 101, . . . , 102, is to be stored in TSI buffer 301 or appropriate offset, (TSI output buffers 321), (Action Block 

303, Control pointer 313 is used to point to the appropriate 431), determined by which of the 32 bytes is being written, 

portions of control map 313 in order to control accessing the 45 Test 433 is used to determine whether 32 bytes have been 

TSI buffer in order to obtain the time-slots that are required written; if not, Action Block 427 is re-entered, and the loop 

to fill the TSI output buffer 321. Input buffer count 331 is repeats Action Blocks 427, 429, 431. When 32 bytes have 

used to control the cycling for accepting inputs from the been written, as indicated by a positive result of test 433, 

appropriate one of the n input buffers 101, . . . , 102, selected then 32 bytes are written from the cache, (Action Block 

by input buffer address register 332, and output buffer count so 441), by a data cache block flush operation in a burst of four 

333 is used to control the distribution of an output collected connected 64 bit data bus writes into the output buffer 

in TSI output buffer 321 to one of the n output buffers 111, . . . , 112, specified by the output buffer address 334. 

Ill, . . . , 112, selected by output buffer address register 334. Test 443 determines whether all outputs have been written. 

Link status memory 341 is used to identify any of then input If not, then the TSI output buffer read address is 

links or any of the n output links that are out of service. This 55 re-initialized, (Action Block 445). The output buffer address, 

status can be checked prior to accepting an input from one (output buffer address 334), is then incremented, (Action 

of the n input buffers 101, . . . , 102, or prior to sending an Block 447), and the loop for writing into the output buffer 

output to one of the output buffers 111, ... , 112. is re -entered in Action Block 427. If test 443 indicates that 

The control map is altered under the control of the all outputs have been written, then the work for this frame 

program of the microprocessor when the microprocessor 60 is finished, and the processor goes back to Block 401 to wait 

receives a control message from connect request register 351 for the next frame synchronization pulse, 

within the control register 124 of FIG. 1, the control message The above flow chart provides double buffering for all 

representing a request to establish or disconnect a connec- time-slots whether they represent nx64 kilobits per second 

tion in the time-slot interchange unit. The process of con- signals such as 256 kilobit data, or a single 64 kilobit per 

trolling the control map is well known in the prior art. 65 second voice or data time-slot. If the additional frame delay 

FIG. 4 is a flow diagram describing the operation of the introduced by the double buffering is not desired for the 

program for implementing a time-slot interchange (TSI) in single 64 kbit/sec voice or data time-slot, then the flow chart 
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can be modified to provide selective double buffering, i.e., 
the single voice or data time -slot is not double buffered. 
Such single buffered time-slots are marked in the control 
map 311, which causes the time-slot to be read from the 
other one of the two TSI buffer frames 301 and 303, by 5 
negating the effect of the double buffer offset. Thus, single 
buffered time-slots may be read out of the opposite frame 
from the double buffered time-slots. 
Generalized TSI Flow 

Hie flow chart shown in FIG. 4 is transversed only once 10 
per frame because each of the serial input streams was 
assumed to consist of 32 time-slots, which in the present 
implementation is written into the microprocessor cache in 
a single 32 byte burst, as described when Action Block 406 
was discussed. Relatively simple modification of FIG. 4 is 15 
required and illustrated in FIG. 5, in order to accommodate 
higher bandwidth serial links: 

(1) Another decision state 451 is required after the ''Yes" 
output of decision state 443 in FIG. 4. This determines 
whether the entire frame of time-slots has been pro- 20 
cessed. If "Yes", we return to the wait state of Block 
401, If "No", we return to the wait for input buffer 
loaded block 404 for the next burst of 32 time-slots. 

(2) The initialize read control map pointer, Action block 
425, is moved out of the TSI read loop to the beginning 25 
of the TSI write cycle, (after the initialized memory 
write address, Action Block 402), since the entire frame 
has not yet been written. 

The RISC microprocessor hardware of FIG. 1, the block 
diagram of FIG. 2, and the programmer data model of FIG. 30 
3, can also be used for implementing a Time Multiplexed 
Switch, (TMS). The basic difference is that a TSI application 
requires the storing and maintaining in memory of either one 
or two frames of time-slots, (single or double buffered 
applications), whereas a TMS application requires the 35 
switching of the time-slots as soon as possible after they 
appear at the input to the TMS. This means that after the 
serial input streams appearing at 101, . . . , 102, (which have 
been written into the TSI buffer of FIG. 3), are read out to 
the serial output streams 111, . . . , 112, their storage in the 40 
TSI buffer is 00 longer necessary. Therefore, subsequent 
write bursts into this buffer during the frame interval can 
overwrite the old data. This means that less memory is 
required for the TMS application than the TSI application, 
since only 32 bytes, (the write/burst size), per serial input are 45 
required rather than one or two frames of memory. Also, 
double buffering is not required for nx64 kbits per second, 
because the time -slots are read out immediately, and thus, 
there is no possibility of the time-slots getting out of 
sequence. 50 

FIG. 6 is a flow chart for implementing a TMS. It is 
similar to the TSI basic flow chart (FIG. 4), and incorporates 
the changes described earlier for generalized TSI flow, as 
well as the differences described above for a TMS. To help 
the reader, the same action is given the same number as in 55 
FIG. 4. For a TMS, high bandwidth facilities, much larger 
than the 2.048 Mbits per second assumed for the basic TSI 
flow chart, are required. This requires the addition of test 
449 in FIG. 6 in order to handle the entire frame, and moving 
the initialize read control pointer, (Action Block 425), from eo 
the TSI read cycle to the frame initialization portion near the 
beginning of TMS write cycle of FIG. 6. These two steps are 
the same as those described for a generalized TSI flow. 

To implement TMS functionality, the only two changes to 
the flow chart are: 65 

(1) Move Action Block 402 from the frame initialization 
portion of the TSI write to the buffer loaded inner loop, 



,741 Bl 

8 

so it can overwrite the previous burst, since as 
described in the previous paragraph, this data has 
already been output; and 
(2) eliminate Action Block 403, which is used to imple- 
ment double buffering. The TMS flow chart of FIG. 6 
implements the time multiplexed switching function. 
A variation on writing the input buffers 101, . . . , 102, 
sequentially into cache, is that instead of taking a 32 byte 
burst from a single input buffer, 8 bytes from each of four 
input buffers are written. This has the advantage of reducing 
the number of bytes of buffering required by input buffers 
101, . . . , 102, from 32 bytes to 8 bytes per buffer. Taking 
16 bytes from each of two buffers can also be implemented. 

FIG. 7 is a flow diagram illustrating the operation of the 
system when used for switching groups of time-slots at a 
time. This use would be for replacement of a digital cross- 
connect (XCON), such as the DACS (Digital Access And 
Cross- Connect System), systems manufactured by Lucent 
Technologies. 

Blocks 461, 463, 465, and 467, replace the functions 
carried out by Blocks 429, 431, and 433, in FIG. 6. In the 
implementation described in FIG. 7, only Blocks 461 and 
463 are repeated 8 times. In the final repetition, 465 and 467 
are shown, but instead of using the loop, the program is 
written in-line. Action Block 461 is essentially equivalent to 
Action Block 429 of FIG. 6, and Action Block 463 is 
essentially equivalent to Action Block 431 of FIG. 6; 
however, instead of having a test of 433, the code is simply 
repeated 8 times prior to entering Action Block 441. 

The above flow chart described an eight bit time-slot 
where a byte quantity is read arid written in Action Blocks 
429 and 431. Sixteen and 32 bit time-slots can easily be 
accommodated with a straightforward substitution of half 
word or full word microprocessor instructions, for the cor- 
responding load and store byte instructions. The time-slot 
width can be further generalized to include group switching, 
where contiguous time-slots are switched as a group using 
load/store string instructions in Action Blocks 429 and 431, 
to transfer a sequence of time-slots. The total number of 
bytes of switched information per unit of time, increase with 
increasing time-slot width or group size, since the loop 
overhead of Action Blocks 427 through 433 is reduced 
proportionally relative to that of a byte wide time-slot. This 
is very efficient for switching a 32 time-slot PCM (El) 
facility, for implementing a cross-connect. Some group sizes 
like that of a Tl facility of 24 byte wide groups, might be 
most efficiently switched by padding the 24 time-slots to a 
32 byte group. Groups can be concatenated contiguously to 
form higher bandwidth rates, such as DS3 at the output of 
the output buffers; this is especially useful for performing 
the function of a digital access and cross-connect system. 

The block diagram of FIG. 1 can also be used to imple- 
ment an ATM Switch. FIG. 8 shows the structure of an ATM 
cell header. The ATM cell might be most efficiently switched 
by padding the 53 time -slots into a 64 byte group. This 
requires some control logic in the Input Buffers and Output 
Buffers. The generic flow control bits 5-8 of octet 1 are used 
for overall control to prevent an ATM system from being 
overloaded. The virtual path identifier is split across the first 
four bits of the first octet, and the last four bits of the second 
octet. The virtual path identifier identifies a user. All virtual 
channels of the same user use the same virtual path identi- 
fication. The virtual path identifier is the primary identifier 
used for switching ATM cells within a switch, and for 
identifying incoming AIM cells so that they can be switched 
to the appropriate destination. The virtual channel identifier, 
(the first four bits of octet 2, all of octet 3, and the last four 



04/26/2004, EAST Version: 1.4.1 



US 6,600, 

9 

bits of octet 4), is used by the user to identify a specific 
communication among a plurality of the communications 
between the end users; the specific communication resides 
on a specific channel. The first four bits of octet 4 are the 
payload type (2 bits), one bit reserved for future use, and a 5 
cell loss priority bit The cell loss priority bit is used to help 
determine whether a particular cell may be discarded in case 
of overload. Finally, the header error control octet is a cyclic 
redundancy check (CRC) over the header. 

FIG. 9 is a functional overview of the software control 10 
components of the ATM Switch. They consist of a CRC 
Check, Input Link Control, VPI/VCI Processing, Shaping, 
Quality of Service (QOS) Processing, Output Link Control, 
and CRC Generate. The cyclic redundancy check, (Action 15 
Block 901), is performed on the header of each ATM cell as 
it enters the system. Input link control, (Action Block 903), 
brings incoming data into the memory of the microproces- 
sor. VPI/VCI processing, (Action Block 905), finds a VPI/ 
VCI data block which contains an Input VPI/VCI indication, 20 
an Output VPI/VCI indication, and a Quality of Service, 
(QOS), pointer. Test 907 is used to determine whether a 
shaping test is necessary. Shaping tests are not performed on 
every cell, but typically, on every tenth cell. If this is a cell 
that requires the performance of the shaping function, this 25 
shaping function is executed, (Action Block 909). The 
shaping function determines whether the peak or the average 
allowed data rate is being exceeded. If so, the shaping 
function introduces a throttle to the' transfer of information, 
which is regulated by putting packets into a shaping queue 30 
with limited size, so that if the peak rate is exceeded for too 
long a time, or the average rate is exceeded, there would no 
more space in the shaping queue, and the input would be 
throttled, or packets would be dropped. 

Next, Quality of Service processing, (Action Block 911), 35 
is carried out. Each output link has a plurality of queues to 
provide cells to that output link. The queues contain infor- 
mation of different priority so that certain queues are served 
preferentially compared to other queues. Finally, the output 
link control, (Action Block 913), transmits cells from one of 40 
the QOS queues to an output link, and a new CRC is 
generated. Prior to inserting the cell into one of the QOS 
links, the output VPI/VCI is inserted into the cell header. For 
some implementations, the CRC functions can be done in 
hardware in order to increase the switching capacity of the 45 
ATM Switch. 

FIG. 10 shows the Programmers' Data Model including 
register assignments and the data structures used in the 
implementation. ATM cell routing, defined by the Virtual 
Path (VP), and Virtual Channel (VC), identifiers is imple- so 
mented by table look-up in an off chip Static Random Access 
Memory (SRAM), or in a Level 2 cache, using a hashing 
algorithm. Queuing of cells is implemented by means of a 
shared buffer area in cache memory and linked lists asso- 
ciated with each of the output ports. There is also a linked 55 
list associated with the unused memory locations, which is 
used as a pool for adding members/locations to any of the 
linked lists. Each output link has multiple output queues, 
each of which is associated with a specific Quality Of 
Service (QOS). Each output link uses a table look-up of 60 
priority, to give the identity of the next QOS queue to be 
output. This allows the QOS queues to be accessed in any 
priority sequence desired. 

While in this preferred embodiment, everything is in the 
cache, for other implementations, especially those with high 65 
throughput, much of the data, and some of the more spe- 
cialized programs, can reside in an external memory. 
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The function of the various Blocks of FIG. 10 is as 
follows: 

Block 1001 represents the input buffers to the switch. 

The input buffer address register, 1003, determines which 
buffer the system is processing. 

Cell header address register 1005, and cell header register 
1007, are used for processing the header of one par- 
ticular cell. 

Block 1009 is used for checking and generating the 
header CRC, (in some alternate configurations, the 
CRC can be checked or generated automatically by 
circuitry). 

Blocks 1011, the hashing function register, and 1013, the 
hashing product register, are used for locating the 
VPI/VCI specified in the header of an input cell. 

Block 1015 is the VPI/VCI Table, which is typically 
occupied only 50 percent to allow for efficient hashed 
access. 

Some of the blocks pointed to by Table 1015, are Block 
1017, which is the VPI/VCI block for VPI/VCI 1, 
Blocks 1019, which are empty blocks, and Block 1023, 
the block for the last VPI/VCI. 
Block 1017 includes the identity of the input VPI/VCI, the 
identity of the output VPI/VCI to which the cell should 
be switched, and a pointer to the Quality of Service, 
(QOS), queue which is used for assembling cells to be 
transmitted to the output link. 
The third column of FIG. 10 shows a plurality of QOS 
queues, using a shared memory spectrum, one set 1031, . . . , 
1033 for link 1, and another set, 1035, . . . , 1037, for the last 
link, link "n". Block 1031 includes an identification of the 
link for which cells are being queued, and a pair of pointers 
for the entries in the queue. The entries in the queue are 
linked each to the next, and the head cell pointer is used to 
find the cell in the queue which is to be transmitted to the 
output link, and the tail cell pointer finds the entry in the 
queue in which the next cell can be entered. Finally, Blocks 
1041 and 1043 are used for selecting the particular cell in 
one of the QOS queues which is to be transmitted to an 
output buffer. For each output buffer, there is one link 
control, such as link control 1043. Link control 1043 con- 
tains head cell pointers to the QOS queues. For high priority 
QOS queues, several entries would be made in the table of 
1043, which has sixteen entries, with the sixteen entries 
being more than the typically 4, QOS queues per output 
buffer. The output link register is used to select which link 
is being processed, and the priority counter register is used 
to select the head cell pointer for that output buffer. When 
the head cell pointer of Block 1043 is read, it will point to 
a head cell pointer of one of the QOS queues, and that head 
cell pointer in turn, will point to the oldest cell in that queue, 
i.e., the cell which is to placed in the output buffer. Finally, 
Block 1051 shows the "n" output buffers, output buffer 1, 
1053, . . . , output buffer "n", 1055. The output address 
register 1057 is used to select which output buffer is being 
processed. 

FIG. 11 is a flow chart showing the Cell Input and 
VPI/VCI Flow. The cell input section shows the writing of 
a burst of 32 bytes into the cache memory from the Input 
Buffer selected by the Input Buffer Address. Hie header and 
VPI/VCI Processing are shown in the remaining part of the 
figure. The CRC check can be done in software if desired, 
and is implemented by using the header, a byte at a time, for 
indexing into a 256 byte table. If an error is detected, a 
routine is entered which either corrects the error or results in 
the cell being dropped. After the CRC check, an Empty Cell 
Code Check is done. Empty cells are ignored, but the routine 
goes to the normal "single thread" output routine, ("E" input 
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of FIG. 13). Next, a 32 bit hashing function is used in 
conjunction with the VPI/VCI, to generate a hashing address 
for indexing into SRAM or Level 2 cache, and read a 32 byte 
burst of data for that VPI/VCI. If the correct VPI/VCI is not 
at that address, alternative hashing addresses are iteratively 
tried until either the correct VPI/VCI is found, or the 
exception handling routine, is entered. Hashing algorithms 
are well described in the literature. For a VPI/VCI table 
which is only 50% occupied, the average number of searches 
required by the implemented algorithm is 1.5, thus, provid- 
ing reasonable access times at the expense of memory. When 
the search is successfully completed, shaping is performed, 
if necessary, and the "Output VPI/VCI", i.e., the destination 
for the cell, is extracted from the table and inserted into the 
cell header. 

FIG. 12 is the Output Queue Flow Chart. It consists of 
inserting the cell into the appropriate output queue based on 
the output link, and the QOS specified in the data associated 
with the VPI/VCI search described in the previous para- 



of the synchronized loading of the input buffers 101, . . . , 
102, (FIG. 1), in serial input streams. Action Block 1103 
indicates a wait for the signal that input buffer has been 
loaded. When the input buffer has been loaded, the memory 
5 write address for unloading that buffer into microprocessor 
memory is initialized. The cell is then read from the input 
buffer, (Action Block 1107), and the input buffer address is 
incremented, (Action Block 1109). At this point, the cell has 
been loaded into the memory of the microprocessor, and the 
10 microprocessor is ready to process the cell. The header of the 
cell is loaded into a register, (Action Block 1121), and a 
CRC check is performed, (Action Block 1123). A CRC 
check is performed only on the contents of the header. A 
CRC check can be performed with special circuitry, or it can 
15 be performed relatively expeditiously through the use of a 
table of 256 bytes; each byte corresponding to one of the 256 
possible CRC bytes. Next, a check is made to see if the cell 
is empty, (Test 1125). An empty cell has an industry standard 
predetermined VPI/VCI identification. Test 1127 determines 



graph. There are "m" QOS queues associated with each 20 whether the cell is in fact empty, and if so, further processing 



output link, and each queue is defined by a linked list, (see 
the "m" QOS Queues Per Output Link Tables in FIG. 10). 
Linked lists are well known in the prior art There is also a 
list of all the unused memory locations defined by an unused 
locations link list, called an "Unused Location Queue", 
(ULQ). FIG, 12 details the pointer and data manipulation to 
implement the linked fist queues. 

FIG. 13 is the Write to Output Links flow chart. The 
priority sequence used for the output queues is to use a static 



is terminated, and the output processing routine of FIG. 13 
is entered. If the cell is not empty, then the VPI/VCI Table 
entry for this cell must be found. Action Blocks and Tests 
1129, 1131, 1133, 1135, 1137, 1139, 1141, 1143, and 1145 
25 describe this process. The VPI/VCI Table, (Table 1015), of 
FIG. 10 is found, (Action Block 1129). A hashing function, 
a known constant, is then loaded into a^ register of the 
microprocessor, (Action Block 1131). This register is then 
multiplied by the contents of a register containing the 



Per Output Link Priority Table, (see FIG. 10), to establish 30 VPI/VCI, (Action Block 1133). In one example of this 

the sequence of queue readout on a per link basis. The Per embodiment, there are up to approximately 2,000 VPI/VCI 

Output Link Priority Tables shown in FIG. 10, show, (as an entries, such as Block 1017 of FIG. 10. In the Table, 12 bits 

example), 16 entries, each of which could specify any of the of the product generated in Action Block 1133, the least 

"m", (e.g., m-4), queues established for that link. If the significant 12 bits in this case, are then used to read an entry 

selected queue on a link is empty, each of the other queues 35 in the VPI/VCI Table. The Table is 4,096 entries long, and 

are interrogated until a queue with data is found, or it is corresponds to the 12 bit accessing queue. In Action Block 

determined that all of the queues associated with the fink are 1137, the actual VPI/VCI is compared with the VPI/VCI 

empty. If a cell is present in any of the queues, then the CRC found in the accessed VPI/VCI Table, (Action Block 1137), 

is generated and inserted in the header and the cell is if Test 1139 is used to determine if the two are equal, 

transferred to the output buffer. If there is no cell in any of 40 Equality means that the appropriate VPI/VCI Table entry has 

the queues, then the CRC for Idle Code is generated, and an been found. If not, then Test 1141 is used to determine 

Idle Code cell is transferred to the output buffer. There is whether this is already the "nth try", and if so, the exception 

then some pointer manipulation associated with housekeep- handling routine 1143 is entered. This routine searches a list 

ing of the linked lists. There is further housekeeping asso- of VPI/VCI Table entries, (not shown in an Auxiliary Table), 

ciated with priority and buffer address manipulation. There 45 used for serving cases in which "n" tries fail to locate a 

are also some decision points regarding All Links Written, VPI/VCI. Entries in the Table are created in those cases 

Shaping, and All Cells Read resulting in appropriate loop where an attempt to load the Table, encounters "n" failures, 

back to entry points in FIG. 11, or transfer to the shaping If this is not the "nth try", then a different 12 bits of a 32-bit 

routine, product generated in Action Block 1133 is used, (Action 

Shaping, (Action Block 909), occurs at multiple, periodic 50 Block 1145), in order to access a different entry of the 

cell intervals to assure that the per VPI/VCI contracted peak VPI/VCI Table, (Action Block 1135). 
and average bandwidths are not being exceeded. Cells can The hashing arrangement is used because the total number 

either be dropped, delayed or passed through. Shaping is of possible VPI/VCI combinations is over a million, (the 

done on a per VPI/VCI basis using linked list auxiliary VPI indicator is 8 bits long, and the VCI indicator is 12 bits 

queues. The details for performing shaping are well known 55 long), so that 2 20 , (more than one million), possible values 



in the prior art. Additional information is stored in the 
VPI/VCI Table of FIG. 9. For the shaping interval being 
considered, (e.g., every 10 cells for peak rate, and every 100 
cells for sustained rate), the following information is pro- 
vided in the VPI/VCI table: contracted Peak Cell Rate, 
(PCR), time stamp for PCR, contracted Sustained Cell Rate, 
(SCR), time stamp for SCR, and maximum size of shaping 
queue. 

The individual Blocks of FIGS. 11-13 will now be 



of VPI/VCI exist even though only 2,000 are being used at 
any one time. 

Once the appropriate VPI/VCI Table entry has been 
found, (with output of Test 1139), Test 1151 is used to 
determine whether shaping is required in this case. In this 
embodiment, shaping actions are performed only on every 
"nth w cell, wherein "n" may, for example, have a value of 10. 
Shaping is used to monitor the input rate of a particular VPI, 
to insure that the VPI does not send more cells than is 



described. FIG. 11 starts in Block 1101, wherein the system 65 allowed for its peak rate. The peak rate is defined as the 
is waiting for a frame synchronization pulse. When the number of cells which may be sent for a particular interval, 
frame synchronization pulse arrives, it signals the beginning If more than this number of cells is sent, then the extra cells 
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are either simply discarded, or are temporarily passed on, The queue pointed to by the priority table, is then checked 

but a slow-down message is sent to the source of the cells. to see if it is empty, (Action Block 1305). Test 1307 is used 

After the shaping function has been performed, (Action to determine whether the queue is empty, and if so, whether 

Block 1153), or in case shaping is not required for this cell, this is the last, (4th), queue, (Action Block 1309). If it is not, 

then the output VPI/VCI identifier is loaded from the VPI/ 5 111611 the queue counter is decremented, (Action Block 1311), 

VCI Table into the cell and is substituted for the input and the corresponding queue is checked to see if it is empty, 

VPI/VCI. Thereafter, the output queue routine of FIG. 12 is (Ac ti ? n Blo <* U J* 551 * ° f ^ 13< ? 7 > 

entered initially, or after having gone through the loop, using 1309, 

The system reads the QOS pointer stored in the VPI/VCI 13 "> ^ "05, indicates that the queue is not empty then 

ni , i. . . ii - * ji_ a CRC is generated for the cell header, (Action Block 1313), 

Block. This pointer points to, j iaA ceU pomter within the 10 ^ ^ ^ ^ ^ ^ m ^ ^ (Aq J q 

QOS queue for serving tha ^VPIyVCL The QOS queue (for Block m5) ^ buffef iQcreme nted to 

example, Block 1037 of FIG. 10) is used to queue cells for for i DseqU ent processing, (Action Block 1317), and 

transmission to an output link. As previously mentioned, ±e qucuc mis ^ was transferred to the output buffer, is 

several QOS queues serve a particular output link and updated to add the storage of the ceU which was transferred 

depending on the quality of service being supplied to a 15 to the buffer to the list of empty locations in the queue, and 

particular VPI/VCI, the cells are stored in a different queue, to update the head cell for the queue. 

and different QOS queues are served preferentially, for Action Blocks 1321 to 1325 represent link list pointer 

delivering their contents to an output link. The contents manipulation for reading from a queue to the output link, 

within each QOS queue are stored in a linked fashion, and and is similar to the write sequence for Action Blocks 1204, 

the last entry is pointed to by a tail cell pointer. It is this 20 1211, and 1213, described above. In this case, however, a 

pointer which is pointed to by the QOS pointer in the cell location is added to the ULQ pool, and a cell location 

VPI/VCI Block. The QOS queue "m" pointer is read, is removed from the queue that has out-putted a cell. 

(Action Block 1203), and a "n" link from that idle queue Test 1335 is then used to determine whether outputs to all 

location to the next idle queue location, is temporarily stored links have been sent. If not, the output link priority table is 

in a register of the microprocessor, (Action Block 1205). 25 incremented, (Action Block 1337), so that at the next pass, 

The cell is then stored in the queue at the location originally the next link will be served. Action Blocks 1339 and 1341 

specified by the "n" cell pointer, (Action Block 1207), and are used to unload the shaping queue. In the case that outputs 

the address of the next empty cell. to all links have been generated, (positive result of Test 

In order to share the available memory space effectively 1335), then the output link priority counter is incremented, 

and dynamically, linked lists are used for each of the output 30 1351, the input buffer address is initialized, (Action Block 

queues. In addition, there is an "unused location" linked list, 1353), so that the first input buffer is then serviced, the 

which is a global resource containing the empty, (unused), output buffer address is initialized, (Action Block 1355), so 

locations available for storing information in queues. When that at the next pass, the initial output buffer will be serviced, 

a queue needs to add information, it gets the available and the output link address register is initialized (Action 

location from the "unused location" linked list. As a result, 35 Block 1357). Test 1359 then determines whether all cell 

both the "unused location" b'nked list, and the linked list of have been read from the input buffers, and if not, Action 

the queue requesting an available location, are impacted. Block 1103 of FIG. 11, is re-entered. If all cells have been 

There is a separate head cell pointer, and tail cell pointer read, then Action Block 1101 of FIG. 11 is entered, 

associated with every queue, including the Unused Location FIGS. 11 to 13 show the flow charts for implementing an 

Queue, (ULQ). 40 ATM switch, exclusive of the shaping, (which occurs only at 

The head cell of the ULQ is the next available location for multiple cell intervals to assure that the per VPI/VCI con- 
storing a queued cell, and the tail cell of the ULQ is the last traded peak and average bandwidths are not being 
cell that has been returned to the ULQ pool. The head cell exceeded). The flow chart deliberately shows a "single 
of a queue is the last cell that has been stored in that queue, thread" implementation, i.e., one cell at a time is taken from 
and the tail cell of a queue is the next cell to be out-putted 45 input to output before the next cell is input in order to 
from that queue. The head cell of the ULQ becomes the tail demonstrate the logic of the design. Efficiencies in processor 
cell of the queue that is requesting a storage location, and the utilization can be obtained by overlapping functions such as 
linked lists of both are modified to support this transfer of I/O read/writes, and read/writes of off chip memory and 
memory location function. Specifically, Action Block 1204 Level 2 caches, by doing "multiple thread" ATM cell pro- 
extends the queue to include the cell stored by Action Block 50 cessing. 

1207, and Action Block 1211 updates the pointer to reflect The above implementation of ATM switching assumed 
this linked list extension. Action Block 1213 changes the that the ATM cells coming into the switch were in the format 
head cell pointer of the ULQ to reflect the removal of an of 53 contiguous time-slots, which characterizes an impor- 
available cell location. tant segment of the applications. There are other applications 
Following the execution of Action Block 1213, the output 55 where an ATM cell comes in over lower bandwidth pipes, 
processing of FIG. 13 is performed. Block 1043 of FIG. 10, e.g., fractional Tl /El using 128 Kbps, 384 Kbps, etc. For 
is a series of 16 pointers to the "m" QOS queues of a those cases, the ATM cell needs to be aggregated by exam- 
particular output link, where "m", in this example, is much ining a number of frames until the entire 53 byte cell is 
less than 16, typically 4, so that the 16 entries can be used available. There are several ways to implement this. One 
to service different QOS queues more, or less, frequently. 60 way is to consider this function as part of the periphery, and 
Associated with an output queue is a priority counter 1045, provide a separate RISC microprocessor to provide the 
which is used to select the appropriate entry from the priority function. A second way, is to incorporate the aggregation 
table. In Action Block 1301, the priority counter is used to function into the ATM switching fabric discussed above, 
index into the priority table of the output link being serviced, Different tradeoffs will exist for different applications, e.g., 
(different output links are serviced on a rotating schedule). 65 the ratio of fractional ATM to complete cell ATM, as well of 
The priority counter is then incremented in order to prepare the size of the switching job being considered and the 
for servicing the link the next time, (Action Block 1303). amount of real time available. 
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The block diagram of FIG. 1 can be used to implement an Frame relay switching can also be implemented within a 

Internet Protocol, (IP) switch, as well as an ATM switch, software based universal switching fabric. In the frame relay 

whose functionality is described in FIGS. 9-13. Unlike the case, HDLC based processing would be best performed by 

ATM case, an IP packet is of variable length, and has a input adaption hardware, because the bit oriented processing 

destination address field that requires a longest prefix match 5 WO uld often not be cost effective in universal switch soft- 

for switching. The variable length implies more flexible ware. Assuming that well formed frames were handed to the 

buffer allocation schemes, and potentially, requires packet frame switching software, hash searching over DLCI field 

fragmentation and reassembly, depending upon the maxi- information would yield out-put link and queue information, 

mum transmission unit sizes m the different networks that Separate Operations, Administration and Maintenance, 

the IP switch would switch between^The sequence of w {0Am softwm woM m ^ { ^ ihc f Tmc toniing Moi . 

mati0D cmbodied * thc framc hash ™ te tabfc - Subsequent 

consist of header checksum verification, input link control, A t . t r r .. n 4 t 

destination processing, quality of service processing, output output processing would unlink the frame from its ou pu 

fink control, and header checksum verification. In some <J ueue for transmission within an HDLC format by output 

implementations, the header checksum processing could be adaption hardware. 

done in hardware in order to improve the capacity of the IP 15 So far > ^is document has described examples of single 

switch. function switching fabric implementations which can reside 

After IP header checksum verification, IP packet routing on, and be implemented by a common RISC microprocessor 

examines the destination address field of the IP header, and architecture. These single function switching fabrics can 

performs a hash based look-up algorithm that can search for reside, and be implemented concurrently on the same micro- 

a longest prefix match as is well described in the literature. 20 processor. 

The search would return information about the appropriate In its simplest form, the different type of fabric function- 
output link. Further analysis of the packet header could yield ality can be allocated on a per serial link interface to the shift 
treatment information for implementing various levels of registers shown in FIG. 1. This would be done under the 
quality of service, and would locate a specific output queue control of software than can be downloaded, as required. For 
associated with the output link and the assigned treatment ^ each type of lirik, the program for processmg the protocol of 
quality. If the packet length is larger than the maximum ma t lime is executed when processmg mat link. For example, 
transmission unit size of the output link, then the packet ^ time-slots and circuit switched time-slots destined 
would be fragmented and linked to the appropriate output to TSI functionality occupied separate serial link interfaces, 
queue as a sequence of packets. Output link processing ^ would ^ ^ cache ^ described 
would selec ^a packet from the . highest priory queue of the 3Q implementations. The bandwidth of 
moment, and perform final adjustments to the selected IP . . ~ . , r . f , 4 , , 
header such as adjusting the time to five field and the header * ese f nal ^ °J J?™^ ^ ^ 
checksum of the modified IP header before committing the depending on the application and the specific serial link, 
packet to the actual physical output link. A time to five field Smce 151 time-slots must be retained for one or two frame 
is used to discard an Internet packet if it is not delivered intervals, (depending on whether the time-slots are single 
within the time, or the number of switching points traversed, 35 buffered or double buffered), reading in of subsequent ATM 
specified in the field. cells which do not have this frame retention requirement, 
IP, (Internet Protocol), switching can be performed within could result in the corruption of the TSI data in the cache. If 
an universal switching fabric via software emulation of cache lines are locked after each input burst until the data is 
functionality, that would in less flexible implementations, be no longer required, then this potential problem is avoided, 
performed in hardware, often in Field Programmable Gate 40 This can be extended to more than two concurrent fabric 
Array, (FPGA) based state machines. In all types, including positional switching, (e.g., TSI, TMS, and 
implementations, well formed packets are eventually handed XCON), and packet switching, (e.g., ATM, IP routing, and 
to switching and routing software. The headers of these Frame Relay). Allocation to individual serial links may be 
packets would be exarnined for classification as to flow unnecessarily restrictive, for many applications and the 
types via hashing to determine output queues. The flow 45 different types of traffic can reside on the same serial link 
classification could use various protocol and port data from with specific chunks of bandwidth being allocated for each 
the packet to be switched in addition to the IP destination, in protocol type, for switching data being transmitted in dif- 
forming keys to the hashing process. The hashing search ferent protocols. This could also be done by downloading 
ultimately yields output link and queue information allow- the appropriate data or software. This could be done using a 
ing for, (Quality of Service), QOS treatment. Various IP 50 "recent change" mechanism, as customers select or change 
fields such as (Time to Live), TTL, would be updated as the their service type. 

packet was linked to output queuing. The routing informa- In the descriptions for TSI interfaces, the description 

tion embodied in the flow based hashed search table, would indicated that the 24 bytes could be padded to 32 bytes in the 

be maintained through gateway protocol processing. Output input/output shift registers. Similarly, for ATM interfaces, it 

handling would on a per link basis, always determine the 55 was suggested that the 53 byte cell could be padded to 64 

next best output queue, to unlink a packet from for actual bytes in the input/output shift registers. Although this is 

packet transmission. As is described in the above packet reasonable to do when only a single functional type is 

formation case, the packet output case could also be embod- allocated to a shift register, it may add too much complexity 

ied in several different implementations. IP switching uses when multiple functional types are allocated to a specific 

many of the mechanisms described in more detail in the 60 shift register. Thus, it may be preferable to read, (burst in), 

ATM section. Depending upon performance trade-offs, dif- or write, (burst out), the links to/from cache as they are, i.e., 

ferent embodiments of these concepts can move the func- a contiguous stream of time-slots, and do the padding 

tionality of packet formation from serial streams; various manipulation in software, inside the microprocessor, 

separate sequentially cooperating processors can be used Bandwidth allocated for different traffic types within a 

instead of a single processor to form packets from within 65 given serial link could be flexibly manipulated by the subject 

TSI time-slot locations marked as containing packet stream microprocessor, with a linked list of data structures being 

data. used to describe sequential memory bytes from each serial 
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interface. Separate input and output lists for each interface 
could be interpreted by the microprocessor with descriptive 
codes indicating the traffic type, with length information and 
application specific points and indicators that could, for 
instance, indicate where circuit switched data belong within 
a TSI, or where packet data would be buffered, for reassem- 
bly. The switch retains control data for a frame. 

For example, the microprocessor might interpret from the 
linked data structures for a given interface, that the next M 
bytes of data should be treated as circuit switched data to be 
sent to the next M sequential locations of a TSI. The next 
linked data structure might then contain a code and length 
indicating that the next N bytes contain a part of an IP packet 
that is being assembled at the reassembly area, pointed to 
from the data structure. Finally, for example, the last linked 
data structure might indicate that the next P sequential bytes 
contain ATM cells. 

Such a linked list of input and output descriptor data 
structures could flexibly describe any variety of traffic types 
within input and output interfaces. The descriptors could 
also indicate how data should be interpreted within distinct 
virtual tributaries of the same physical interface. OA&M 
software would be used to maintain the contents of the 
descriptor data structures. 

Advantageously, these concepts can be used, for example, 
within a single microprocessor universal switch application 
at a small business, where in the prior art, the small business 
would lease separate fractional Tl facilities, with one Tl 
facility for PCM circuit switched voice traffic, another for 
frame relay based IP world wide web traffic, and yet a third 
Tl facility, for ATM based video conferencing. The leased 
cost of these separate facilities would often be substantially 
more than the cost of a single facility, even when more 
bandwidth would be available if a universal switching 
element could be used. The universal switching element can 
also, advantageously, offer the dynamic adjustment of band- 
width between different traffic types within the consolidated 
leased Tl facility. 

The result of putting multiple concurrently running 
switching fabrics in a single microprocessor, is to have a 
minor impact on the capacity of the switches because of the 
real time impacts described above. It is estimated that a 300 
MHz EC 603e PowerPC can support about 480 Mb/sec, 
(7500 time -slots), of single time-slot TSI switching, or about 
1.5 Gbit of ATM cell switching, (3 million cells per second). 
When sharing functionality on a single microprocessor, the 
capacity of each of the applications is reduced by the 
proportional amount of their real time usage. For example, 
a single microprocessor could concurrently support about 
240 Mb/sec, (3750 time-slots), of TSI, and 750 Mb/sec of 
ATM cell switching, (750,000 cells per second). The ratios 
for a particular application would depend on the traffic mix, 
and could include proportional amounts of Frame Relay and 
IP router switching. 

The above sections have demonstrated concurrent opera- 
tion of circuit switching and packet switching fabrics. The 
RISC can also provide the SAC function required to go 
between the circuit and the packet worlds such as AAL1, 
AAL2, and AAL5, as well as the layering of IP over ATM, 
and IP over frame relay. Thus, not only is there connectivity 
within the each of the switching domains, but also integrated 
interconnectivity between these switching domains. 

FIG. 14 illustrates an arrangement for increasing the size 
of the TSI of FIG. 1. FIG. 14 shows an implementation that 
can be applied to any number n of input signals, any number 
k of microprocessor complexes, and any number n/k, that 
can be accommodated by the speed and memory capacity of 
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these complexes. In the specific embodiment of FIG. 14, n 
is 32, k is 8, and n/k is 4. Each of the input streams 
terminated at the buffer amplifiers 521-1, . . . , 521-32, is 
connected to a shift register input buffer similar to the input 

5 buffer 101. For microprocessor complex 501-1, shift regis- 
ters 511-1, . . . , 511-32, are connected to local bus 541-1, 
from which microprocessor complex 501-1 accept inputs. 
The same arrangement is available for each of the 7 other 
microprocessor complexes 501-2, . . . , 501-8. Each micro- 

10 processor complex feeds only four of the total 32 output 
buffers. For example, microprocessor complex 501, feeds 
output buffers 531-1, . . . , 531-4. The capacity of each 
microprocessor complex must be adequate to take inputs 
from the full range of input shift registers, but need only 

15 drive 1 over k of the output streams. Fortunately, the 
absorption of the inputs is done in parallel, since input 
signals are loaded into sequential locations in the TSI buffer 
301, 303, of each microprocessor. Thus, very large amounts 
of input data can be absorbed per unit time in the micro- 

20 processor caches. It is only the output data which requires 
sequential time-slot by time-slot, or group by group, pro- 
cessing by the microprocessor. 

The arrangement of local shift registers per microproces- 
sor complex has the advantage of limiting high bandwidth 

25 connections to the nearby locality of each microprocessor, 
with the corresponding disadvantage of requiring replicated 
shift registers for each microprocessor. In another arrange- 
ment that might sometimes be advantageous, a single global 
set of shift registers could be used, with each microprocessor 

30 in lockstep, absorbing the same input data at the same time. 
In this case, the complexity of high bandwidth global 
connections and global microprocessor synchronization, 
would be traded against the savings of a set of shift registers 
for all but one of the microprocessors. 

35 Theoretically, it is possible to take input data and process 
the input data serially in order to generate pre-ordered output 
data. The arrangement of FIG. 14 does not work satisfac- 
torily for that kind of arrangement, (processing inputs seri- 
ally to generate parallel outputs), because for each input 

40 word that is received in parallel, different microprocessors 
are required to do different amounts of processing, since 
each processor may process a different number of bytes to 
generate output streams for its outputs. 
FIG. 15 shows a 3 stage Clos Network of switching 

45 microprocessors. The attribute of this type of network is that 
it requires the minimum number of time-slots, ("cross- 
points" in earlier technologies), to provide a non-blocking 
network. For "n" inputs to the Input Microprocessors in FIG. 
14, 2n-l outputs are required for non-blocking. For large 

50 number of time-slots, 2n is a very close approximation, and 
errs on the conservative side. Consider FIG. 15 for a large 
TSI implementation. The input and output microprocessors 
have symmetrical capabilities. The total capacity of this TSI 
is "k" times that of a single microprocessor. There is a 

55 Control RISC Microprocessor that receives control mes- 
sages from an external source, which has defined the end 
point time-slot connections required. The Control Micro- 
processor figures out the appropriate paths through the Clos 
Network, and dispatches appropriate control messages to 

60 each of the microprocessors requiring this information. This 
is done only once for each call. Each of the Input Micro- 
processors needs to assemble links of time-slots to send over 
to the appropriate Output Microprocessor. The Output 
Microprocessor receives the data from the center stage and 

65 makes the appropriate output connection to complete the 
path. The "j" center stages provide a space switch path 
between Input and Output Microprocessors, and switches an 
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assembled group of time-slots to its proper destination. several contiguous time-slots. Just as for the switching 
Thus, the center stages are providing "group switching" as fabric, the serial bit stream is shifted into an external register, 
defined earlier in this document, and it has been shown also then burst into the cache as bytes of information. The RISC 
in this document that there is more than a 5 times increase examines the contiguous byte sequence to see if it corre- 
in the number of time-slots switched for a "group switching" 5 sponds to the synchronization code. If not, then a single bit 
microprocessor, compared to a single time -slot switching. shift instruction is implemented and the resulting, changed 
Thus, the center stages "1" through "j" each have 5 times the contiguous bytes are examined. This procedure of examin- 
capacity as one of "k" I/O processors. The center stage needs ing the input bit stream for the correct code as subsequent 
to handle twice the number of time-slots, ("2n") as the I/O bytes are entered via the I/O, continues until the synchro- 
stages, ("n") per processors, but as shown above, has 5 times 10 nization code found. This establishes the frame synchroni- 
the capacity. The number of center stages required is j=2(n/ zation point, and puts the serial link into synchronization. An 
s)k, where n/s=l/5 is the ratio of the processing capacity of additional byte in the sequence establishes the super-frame 
a "k" stage, TSI relative to a "j w stage group switching, and boundary, which is searched for until super-frame synchro- 
"k" is the number of input or output stages rounded up to the nization is also attained. Other required functions can simi- 
next integer. For example, for "k"=5, j=2(l/5)5=2. Thus, for 15 larly be implemented by appropriate operations on the bit 
a TSI 5 times the size of a single processor TSI, a total of 5 stream. Multiple links can be supported on a single micro- 
input, 5 output and 2 center stage processors are needed for processor. Other serial telecommunications links can also be 
a total of 12 microprocessors for 5 times the capacity, or implemented including the well-known standard DS1, DS3, 
25,000 time-slots using the benchmark 200 MHz PowerPC. DSn, El, E3, and other 32 channel based facilities, SONET, 
FIG. 15 can also be used for packet switching, e.g., ATM. 20 SDH, as well as proprietary serial links such as the PCT, 
In the case of ATM, much of the real time is involved in NCT (Network Control and Timing), PDDB (Peripheral 
VPI/VCI Table look-up, and QOS queuing and priorities. Interface Data Bus), etc., used by the 5ESS® Switch. 
These functions can be shared between Input and Output One protocol can be used for transmitting data in another 
microprocessors of FIG. 15 in a functional distributed man- protocol. For example, the frame relay protocol, or the ATM 
ner. Thus, about twice the ATM switching capacity can be 25 protocol can be used for transmitting data in the IP protocol, 
provided by means of using both input and output micro- The switching system can then switch data in the carrying 
processors for different functions. The center stage pure protocol, and the carried protocol data can then be extracted 
connectivity bandwidth, however, matches the bandwidth from the switched data. 

needs of the I/O pretty closely Thus, for 5 input stages and A specific microprocessor can be used to terminate either 

5 output stages, 5 center stages are required. The capacity of 30 one type of the serial links or several types of these serial 

this network is, however, about 10 times that of a single links concurrently. The multiple microprocessor configur- 

ATM microprocessor because of the functional distribution. ations shown in FIGS. 14 and 15, can also be also be used. 

Thus, 15 microprocessors provide 10 times the capacity or The above has described a Universal Switching Fabric 

a 10 Gbps ATM switch, for the benchmark 200 Mhz and a Universal Serial Link Termination as separate entities, 

PowerPC. Proportionally, higher band widths can be 35 but they can be combined in a single microprocessor. For 

obtained with higher performance PowerPCs. example, a single microprocessor can terminate any of the 

FIG. 16 shows a three stage network which uses an ASIC above serial links described, and then provide the ATM 

or FPGA to implement the center stage as a crossbar switch, switching in the same microprocessor concurrently. If 

(circuit switched). This may be more cost effective for larger desired, universal serial link termination, (any/all of the 

configurations, at the expense of the lead time necessary to 40 above links described in this document), can be coupled with 

get the ASIC or FPGA. the universal switching fabric, (any/all of the above fabrics 

FIGS. 15 and 16 show a three-stage network for getting discussed in this document), and operate concurrently in the 

from input to output. There is an architecture where the same microprocessor. The multiple microprocessor configu- 

network shown in these figures constitute a single entity of rations shown in FIG. 14 and FIG. 15, also apply. Of 

a larger interconnected configuration, comprising a plurality 45 particular interest, is the coupling of periphery and switch- 

of entities, connecting to a global center stage for intercon- ing fabric on the same microprocessor, and then extending 

necting the entities. For such a configuration, the center a fink to the center stage of FIG. 15 to provide a distributed 

stage of FIG. 15 or FIG. 16, can be connected directly to the fabric with Integrated periphery. 

global center stage. Thus, multiple subsstantiations of FIG. The above approach can be used to provide higher level 

15 or FIG. 16, can be interconnected via this global center 50 combined switching functionality. An example is the imple- 

stage. These center stages, in conjunction with the global mentation of the functionality of an entity such as a trunk 

center stage constitute a three-stage configuration for the only, 5ESS® Switching Module on a single microprocessor, 

new center stage. The input and output microprocessors This would include the TSI, the trunk terminations, the NCT 

implement the same functionality as in FIG. 15 and FIG. 16. interface to the TMS, service circuits such as tone generation 

Just as for a single microprocessor, multiple or single 55 implemented in a chip floating point decimal unit of the 

function switching fabrics can reside, and be implemented microprocessor, or vector manipulation unit as the hardware 

concurrently on the same microprocessor. base. On the same microprocessor, the Switching Module 

Although the discussion thus far has focused on "switch- Processor, (SMP), can be concurrently implemented, (either 
ing fabrics*', the RISC microprocessor can be used to in native or emulation mode), for call processing and main- 
implement other functionality. 60 tenance software, as the software base. For an SM, 

A RISC microprocessor can also be used to terminate (Switching Module) that contains subscriber lines as well, 

serial telecommunications links. As an example, a propri- all of the above can be used to support subscriber lines 

etary PCT, (Peripheral Control and Timing) link can be implemented in the conventional way, as well as the embed- 

considered. This serial fiber optic link has a 1024 time-slots, ded trunk circuits. 

768 of which are used for data transport, and the remainder 65 The microprocessor can also be used to implement gen- 

for control, synchronization, and other functions. Frame eralized logic function cost effectively, especially if they 

synchronization is established by a fixed code established in have a strong component of sequential logic. Thus, this 
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approach can be very useful in providing the functionality such as PCM signals. In case a communication leaves this 

presently implemented by a Field Programmable Gate Array switch, the connection is made between the VT2 connected 

(FPGA), and be more cost effective and provide more rapid to an AIU, such as 1710, or 1711, and a VT2 connected to 

deployment. This approach can also be used for replacing an OIU, or between VT2s connected to OIUs. 

Application Specific Integrated Circuits (ASICs), and 5 In order to be compatible with an existing plant, such as 

replace them with either a single or multiple microprocessor, the 5 ESS® switch manufactured by Lucent Technologies 

depending on the application Inc > me fabric also 0311 communicate with links such as 

While the preferred embodiment shows sequential storage 5ES S® Network Coni ™ 1 and Timing (NCI^ hnk^ 

of input time-slots and readout based on the control memory c ™f" to a COn ^"!^ "?° ° f 3 ?J* S ® 

contents, it is also possible to use storage based on control 10 switch When narrowband NCR 1731 terminate on the core 

. , - . ... i j . fabnc, their signals are converted into VT2 signals. Broad- 

memory contents in conjunction with sequential readout, band ^ CTs 1? | 2 ^ ^ ^ ^ ^ ^ ^ 

although such an arrangement handles broadcast connec- ^ controlled b a swiihing module processor controller 

lions less efficiently The ^arrangement of FIG. 14 does not im> which via charmcls of me fabdc , both 

work satisfactorily for broadcasting m the non-preferred t0 COQtrol circuits thc fabfiC( md to COQtrol circuits ^ 

arrangement, (storage based on control memory and sequen- 15 me md qiu. 

tial readout), because for each input word that is received, p or ease of understanding, FIG. 17 is drawn as if the line 

different microprocessor may be required to do different modules 1710 and 1711 were input modules, and the trunk 

amounts of processing. modules 1720 and 1725, were output modules. Actually, the 

The approach discussed herein has the inherent flexibility central cell fabric 1700 can switch signals from any port to 
of stored program control. It can, therefore, be used to 20 any other port; thus, serving line-to-line traffic, and trunk- 
implement new and different protocols such as the Dynamic to -truck traffic, as well as Line-to-trunk traffic. Also, the 
Synchronous Transfer Mode (DTM), recently proposed by signals for any line or trunk are two-way signals. Thus, the 
the European Standards body. modules 1710, 1711, 1720, and 1725 are all "input" modules 

RISC microprocessor technology is moving at a very fast to cell fabric 1700, and cell fabric makes hairpin connections 

pace. There are microprocessors operating at higher fre- 25 between signals from any connected port to any other 

quencies just beyond the horizon which will allow for higher connected port. The fabric is a three-stage fabric: first, the 

capacities on a single chip. Moore's law will push these module carrying a signal to the cell fabric; second, the cell 

capabilities even further in the future. fabric 1700; third, the module carrying a signal from the cell 

FIGS. 17-20 illustrate a large switch whose core element fabric, 
is a switch 1700 for switching any input VT2 signal to any 30 FIG. 18 illustrates the connections between AIU units 
output VT2 signal. The switch is fed by a plurality of Access (1710 and 1711), and OIU units 1720 and 1725, and the core 
Interface Units (AIU), containing line cards for handling fabric 1700. Each connection to the central fabric is termi- 
ADSL, (Asymmetric Digital Subscriber Lines) lines, line nated on a port such as narrowband port 1809, and broad- 
cards for handling Plain Old Telephone Service (POTS) band port 1839. In the case of a narrowband port, the port 
fines, and trunk cards for handling input digital or analog 35 must recover clock from the input signal, buffer the input, 
trunks. The outputs of these cards are assembled as input generate flow bits for the unoccupied 248 bits of the 
streams which are, in turn, fed to a microprocessor based payload, perform framing, and receive information for con- 
telecommunications switching network fabric element 100, trolling control registers of the port. In the case of the 
such as that shown in FIG. 1. Modules 1711 and 1725 are broadband port 1839, it is required to buffer the inputs, 
narrowband modules, delivering and accepting narrowband 40 recover the clock, perform framing, and receive input infor- 
(DS0) signals; modules 1710 aand 1720 are broadband mation for control register. The narrowband peripherals, i.e., 
modules, delivering and accepting broadband (VT2) signals. those peripherals generating narrowband signals such as 
Module 1711 is connected only to POTS lines and/or ISDN peripherals 1830, which includes peripherals 1711 and 1725 
lines, all of which communicate at DS0 rates, or below. are connected to the central fabric 1700 via a pair of 
Module 1710 is also connectible to ADSL, (Asymmetric 45 paddlecards. The basic function of the paddlecard is to allow 
Digital Subscriber Line) lines. transmission either in the optic, or the electric domain, (and 

Each VT2 signal has a payload of 256 bits, and is possible conversions between the two), to proceed over links 
generated at a repetition rate of 8 KHz per second. In of wire or optic fiber which require interface circuitry in 
addition to the 256 payload bits, there are two signaling bits, order to recover synchronization and framing. The p addle - 
and one bit to distinguish a narrowband signal, (one with a 50 card can also provide a 1:N split of the VT2 stream so that 
payload of 8 bits), from a wide band signal, (one with a N sets of peripheral fabric can be connected to one Broad- 
pay load of some multiple, up to 32 8-bit bytes). An ISDN band Port. This is given the notation 1855 in FIG. 18. 
signal would be sent as three narrowband signals, because 1830 contains hard-coded PCM circuit stream processing 
the two B -channels may go to different destinations, and the equipment, and no ability to do any kind of protocol 
D-cbannel is connected to a packet switching unit. In this 55 processing or switching. This is in direct contrast to 1860 
preferred embodiment, all narrowband signals are sent in the which actually interprets the bit stream, and each type of 
form described above; there is no bundling of narrowband stream can be its own protocol, and can be packet, or circuit 
signals into one VT2 signal. The reason is that the interface switched, per stream. 

ports of the core fabric 1700 can only process signaling For broadband and narrowband peripherals that are 

information for a single logical communication per physical 60 housed, for example, in AIU or OIU, the peripheral 1870 

VT2 channel. feeds a fabric contained in an AIU such as 1710, or OIU such 

The core fabric is also connected to broadband output as 1720, which feeds a broadband paddlecard 1850, for 

interface unit (OIU) 1720, and narrowband OIU 1725, transmission to a broadband paddlecard 1840 that interfaces 

which interfaces with broadband transmission facilities, with the central fabric 1700. The broadband paddlecard 

such as SONET/SDH. The SONET/SDH facility can, in 65 feeds a port 1839. 

turn, carry virtual trunks with packet switched signals such The 768 VT2s are split into 6 streams of 128 VT2s each, 

as ATM signals and IP signals, or circuit switched signals The main reason for this, is that 128 VT2s represents 250+ 
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Mb/s of reading and writing from the stream, and 250+ Mb/s 
of reading and writing to the peripherals, for a total aggre- 
gate of reading and writing and protocol processing of 1 
Gbps. This is a large amount of processing for 1999 multi- 
protocol, heterogenous processing. From a practical basis, 
this is a good size for being able to process all the different 
protocols for a single processor complex. This also is the 
most cost-effective modular unit from a scalability point of 
view. So, the central fabric's "N" ports actually drive "6*N" 
peripheral fabrics. 
Central Fabric 

The central fabric 1700 switches elements of VT2 size. 
VT2 (Virtual Tributary 2), is a SONET/SDH standard signal, 
having a payload with thirty-two time-slots per 8 KHz 
frame, or 256 bits per frame, 2,048,000 bits per second. Each 
VT2 is bi-directional with 2.048 Mb/s in each direction. 

The central fabric has no knowledge of what type of 
service, or bit stream is in the VT2 cells. No interpretation 
of the bit stream is performed. From each port at each VT2 
timeslot, the contents of the VT2 cell are copied to the 
appropriate outgoing port and correct timeslot, (TSI 
function). 

The central fabric has the following types of ports: 
N_in=Narrowband Input Port 
N__out-Narrowband Output Port 
B_jn=Broadband Input Port 
B_„out=Broadband Output Port 
AND the accompanying data paths: 
N_Jn_N_out=Narrowband to Narrowband Path 
N_Jn_B_out=Narrowband to Broadband Path 

B in_N__out=Broadband to Narrowband Path 

B_in_B_out=Broadband to Broadband Path 
Given these definitions, the central fabric performs a 
simple copy according to the following rule: 

Always copy a 32 byte, (256 bit), data quantity+signaling 
bits from input port to output port, regardless of how 
the ports are denned in the control RAM, or which of 
the above paths are defined. No concept of port types 
exits in the TSI itself. 
The essence of supporting both types of ports is in the 
interface to Data RAM protocol. The N_in port will always 
copy, the signaling bits and the data into the signaling bit 
storage, and the first 8 bits of the 256 bit data RAM. The 
remaining 248 bits are loaded with a "fill" pattern from 
internal resources. The B_Jn port will always copy the 
signaling bits and the data into the signaling bit storage, and 
all 256 bits of data. The N_out port will always read only 
the signaling bits, and the first 8 bits of RAM of the timeslot, 
out onto the link. 

The B_out port is special. Here a special 259* bit (an 
extra signaling bit), is written based on the control RAM 
storage. This bit tells whether the particular timeslot is a 
narrowband, or broadband service. In either case, all sig- 
naling and 256 bits are read onto the link. The paddlecard 
that connects the outgoing data to the peripherals may use 
the 259* bit. If it is a broadband peripheral, the bit is ignored 
and all the bits are written on the optical link, and the issue 
of narrowband, or broadband service, is deferred to the 
peripheral fabric. If it is a broadband network interface, it 
actually has broadband and narrowband inter-SM connec- 
tions on the same paddlecard. Each logical port has a 
narrowband timeslot, and a broadband timeslot The nar- 
rowband timeslot is written if the 259* bit indicates this is 
a narrowband service. 

As shown in FIG. 19, the narrowband service is then 
routed to a communication module, such as the CM3 of the 
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5 ESS® switch manufactured by Lucent Technologies Inc., 
as it would be today, on an all-narrowband switch. The 
broadband timeslot to the Switch Center Stage Broadband 
Fabric is written with "fill" information. If the 259* bit 

5 indicates this is a broadband service, then the broadband 
timeslot is written with the broadband service. This link is 
connected to the Switch Center Stage Broadband Fabric. 
The narrowband timeslot to the CM3 is written with the 
appropriate "fill" code. The 259* bit is written by the 
broadband fabric connect order, to a "broadband service 
code". If the register is not written at the time of the connect 
order, the hardware default loads "narrowband service code" 
for the 259* bit. 

Broadband outputs of the Central Fabric 1700 for traffic 
between Central Fabrics are connected via paddlecards 1930 

15 and 1940, to a Parallel VT2-TSI Interconnect Fabric 1910. 
The parallel VT2-TSI (1910) takes advantage of the multi- 
plicity of ports that exit on the center stage (1700). There are 
"N" ports available on the center stage (1700), and typically, 
a balanced center stage has K=N/2 ports attached to 

20 peripherals, and K ports attached to the 2 nd stage central 
fabric (1910). The basic process of constructing a parallel 
TSI second stage, is to assign some of the K ports to each 
of "S" TSIs in the second stage. Multiple first level stages 
are interconnected by assigning some port(s) to each first 

25 level stage in each second level stage. The largest, economi- 
cal scaling possible, is to cross-connect 2*K=N first level 
stages. In this configuration, each first level stage has 
connected all K network ports to K second level fabrics, 
with one port per second level fabric. Since the number of 
ports is actually N per center stage, N different first level 
stages can be interconnected. The total amount of peripheral 
ports, (also K=»N/2), that can be interconnected by a multi- 
plicity of N port first level stages, connected to K second 
level stages, and the first and second level stages are using 

35 identical parts, (this is the economical part), is K*N«N*N/2, 
In practical implementations, in 1999 technology, this 
results in a total interconnection capability of terabits per 
second. 

Peripheral Fabric 
^ The peripheral fabric exists to support broadband 
services, but also supports narrowband services. The units, 
(especially those supporting ADSL), will be simultaneously 
supporting narrowband and broadband services. 
The peripheral fabric can handle any service on the VT2 
45 cell payload. Services that are narrowband will be treated as 
if only the first byte of the VT2*s 32 bytes of data are needed. 
Services that are broadband will use from 2 to 32 bytes of 
the VT2. This is defined on a per-service basis. 
The peripheral fabric supports the following main types of 
5Q broadband bandwidth classes: 
VT2 dedicated 
NxVT2 dedicated 
VT2 shared (packet pipe) 
NxVT2 shared (bigger packet pipe) 
55 The peripheral fabric is created from processors and 
memory with actual software determining how the fabric is 
interpreted, and how the payloads in the fabric are inter- 
preted. The fabric become "universal" because any software 
program can run any protocol on a per stream basis. This 
60 heterogeneous processing capability is supported in both 
best-effort, and zero-jitter, circuit quality varieties. 

The peripheral fabric connects on one side to the central 
VT2 TSI fabric 1700, and on the other side with both 
narrowband and broadband peripherals. The peripheral fab- 
65 ric performs the appropriate bi-directional protocols on a per 
stream basis so that each customer flow is in the correct 
format, and going to the correct location. 
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FIG. 20 is an expansion of the VT2 Cell Fabric (1700). 
The entire fabric consists of "P" ports. Each of the P ports 
can be set up to be narrowband or broadband. Each port 
supports an input and an output flow (bi-directional). Every 
payload can be routed from any incoming port "A" to any 
outgoing port "B". This includes the special case of loop- 
around, where A is equal to B. 

Input Port A (2001) contains the Input Port Signal and 
Timing Recovery Unit (2003). This is programmed from 
information in the Control State Storage (2000) as to 
whether the port is conducting narrowband or broadband 
operations. 

If the port is conducting narrowband operations, the Input 
Port Signal and Timing Recovery Unit will extract narrow- 
band payloads from the incoming signaL For each narrow- 
band payload, the following will occur: the payload will be 
copied into the appro -priate TSI input storage location 
(2013). In particular, the two bits of signal payload will be 
copied to the Signal Bit Storage (2010) via path (2005), and 
the 8 bits of narrowband payload will be copied to the 
storage (2011), via path (2006). In addition, no information 
exists to travel on path (2007) to the selector (2004), because 
it is narrowband. Instead, the selector (2004), takes "fill" 
from the Input Fill Bit Unit (2002), via path (2008), and puts 
it onto path (2009), for conveyance to storage (2012). 2010, 
2011, and 2012, together constitute 2013. 

If the port is conducting broadband operations, the Input 
Port Signal and Timing Recovery Unit will extract broad- 
band payloads from the incoming signal. For each broad- 
band payload, the following will occur: The payload will be 
copied into the appropriate TSI input storage location 

(2013) . In particular, the two bits of signal payload will be 
copied to the Signal Bit Storage (2010), via path (2005), and 
the first 8 bits of broadband payload will be copied to the 
storage (2011), via path (2006). The remaining 248 bits of 
broadband pay-load travel on path (2007) to the selector 
(2040), and then onward to storage (2012), via path (2009). 
The incoming "259'* bit" is absorbed in the Input Port Signal 
and Timing Recovery Unit (2003). 

The data and signaling bits are transferred form input 
storage (2013), to output storage (2019), via internal busses 

(2014) . The TSI output storage (2019) consists of Bit 259 

(2015) , Signal bit storage (2016), and data storage (2017 and 
2018). The information in (2010) is sent to (2016), via path 

(2020) . The information in (2011) is sent to (2017), via path 

(2021) . The information in (2012) is sent to (2018), via path 

(2022) . In addition, Bit 259 (2015), is loaded from control 
stage storage information (2000). Bit 259 contains the 
"broadband or narrowband service indicator". 

The last step for generating the output port signal is to 
apply the specific information from the appropriate TSI 
output storage (2019), to Output Port B (2023), and 
specifically, the output port generator (2024). 

If the output port B (2023) is programmed as a narrow- 
band port, the output port generator (2024) will use the 
information from signal storage (2016), and data storage 
(2017), via paths (2026) and (2027), respectively. The signal 
will then be sent out on path (2029). 

If the output port B (2023) is programmed as a broadband 
port, the output port generator (2024) will send out all the 
information on path (2029), received from the 259* bit 
(2015), signal storage (2016), and data storage (2017 and 
2018), by means of paths (2025, 2026, 2027, and 2028), 
respectively. 

FIG. 20 shows only a single input time-slot and a single 
output time-slot of a signal memory for one port. The rest of 
the time-slots are not shown. In one preferred 
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implementation, the memory for the multiple time-slots is 
housed within a single integrated circuit chip. The output of 
selected time -slot of the memory is routed over the bus 
system 2014 to a time-slot of the memory of a selected 

5 output port The sequential time-slots of the memory are 
loaded sequentially, and selectively transmitted to a selected 
time-slot of a selected output port, or are loaded selectively, 
and sequentially transmitted to a selected time-slot memory 
of the selected output port. The selection process for each 

10 port is independent of the selection process of other ports, 
and is controlled by separate memory in the control stage 
storage. 

If only a single input time-slot and output time -slot is used 
for each port, then Block 1700 becomes a Time Multiplexed 
15 Switch. 

Time Multiplexed Switch (TMS) Core Fabric 

In a variation on the architecture of FIG. 17, the Time-Slot 
Interchange Fabric (TSI), cell fabric (1700), is replaced by 
an nx64 Kbps Time Multiplexed Switch, (TMS). Tne TMS 

20 can allocate groups of 64 Kbps time-slot, (e.g., 128 Kbps, 
384 Kbps, 1.5 Mbps, 2.0 Mbps, etc.), on any specific link. 
This implementation provides more granularity than the 
VT2 TSI (1700), i.e., less bandwidth is used for connections 
which are lower than 2.0 Mbps. Since the TMS is a space 

25 switch, the TSI functionality must be provided somewhere 
else. For the B-AIU (1710), and BB-OIU (1720), the periph- 
eral fabric, (100), software is added to the microprocessor to 
provide the TSI function. The peripheral fabric has the real 
time capacity to be a 6000 time-slot TSI for a 300 MHz 

30 Power PC. The Peripheral Fabric 100 TSI voice call 
requirement, as shown in FIG. 17, is 128 voice calls, (128 
maximum), of the thousand of time-slot capability of the 
peripheral fabric, (100). Thus, a minimum of 97% of the 
peripheral fabric real time is still available for ATM, IP, and 

35 other switching methods, and real time activity. 

The AIU (1711), and OIU (1725), however, do not contain 
a peripheral fabric, and so the TSI functionality must be 
placed somewhere else. The microprocessor based TSI for 
the narrowband AIU and OIU, can be co-located with the 

40 TMS, and switch the 128x DSOs from multiple AIUs and 
OIUs. Multiple TSIs can be used, up to the capacity of the 
TMS. This architecture allows for a lower bandwidth posi- 
tion switch to be used as Peripheral Fabric 100, and would 
be appropriate and cost-effective, for example, for initial 

45 market penetration, where the percentage of customers 
using, or needing the full 2 Mbps, would be small, or 
modest. The same board could be used for the peripheral 
fabric, (100), and the TSI used for the narrowband TSI, with 
only a software difference. 

50 Network Connections 

The following half call scenarios, illustrates connections 
in the network. 

Half Call Scenario 1: Narrowband Service on Narrowband 
Port 

55 This is the simplest case. This is support for the current 
legacy narrowband interconnection between narrowband 
peripherals, and narrowband fabric. Remember that the 
broadband fabric can not tell that the service is narrow, and 
will automatically copy 32 bytes per frame. As was 

60 described earlier, the N_Jn and the N_out ports 1809, 
provide special capabilities to add in "fill", and subtract out 
"fill". The end result is that narrowband to narrowband 
connections are possible. 

There are two flows in a narrowband service. 

65 The first flow is in the direction of peripheral to central 
fabric. The second flow is in the direction of central fabric 
to peripheral. The first flow travels from the peripheral 1830 
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to the narrowband paddlecard 1820, for transmission to the 
central fabric 1700. The reader will note that it is common 
to provide one or more stages of concentration to reduce the 
number of narrowband ports that are needed, prior to paddle- 
card transmission. The narrowband link transmits a number 
of DSO timeslots, with each timeslot also being appended 
with internal-specific signaling bits for reliability and other 
purposes. The narrowband link is terminated on another 
narrowband paddlecard 1810, at the central fabric. The next 

step is to send the bit stream to any N in central fabric port 

1809. At this point, the "half call" can be switched in the 
Central Fabric, as previously described. 

The second flow is in the direction from the central fabric 
to narrowband peripherals. The bit stream will have been 
copied in the central fabric to the N_out port 1809 buffers, 
as previously described in the Central Fabric Section. From 
there, the bit stream is sent to the narrowband paddlecard for 
transmission to the peripherals' location. The stream will be 
terminated by another narrowband paddlecard 1820. Note 
that it is common to then place one or more fanout, (opposite 
of concentration), stages to expand a densely occupied 
timeslot stream into multiple sparsely occupied streams, 
each flowing to a particular multiplicity of peripherals on a 
specific sub-unit. The flow then continues to the actual 
peripherals 1830, themselves. 

Half Call scenario 2: Narrowband Service on Broadband 
Port 

There are two flows: 

The first flow is from narrowband peripheral 1870 to 
broadband port (B_in). The narrowband stream flows to the 
peripheral fabric which can now perform the classic con- 
centration function, without hardware stages. The peripheral 
fabric 1860, places the narrowband flows into their respec- 
tive VT2s. In this case, only 1 of the 32 data bytes contains 
real information, the remaining 31 bytes contains "fill" for 
each stream. The peripheral fabric emits the flow to the 
broadband paddlecard 1850, which transmits the flow to the 
central fabric location where it is terminated on another 
broadband paddlecard 1840. This paddlecard then transmits 
the flow to a broadband input port, (B_in) 1839. As dis- 
cussed earlier, in the Central Fabric Section, the entire VT2, 
(1 real, and 31 fill bytes), will be switched in the VT2 TSI. 
However, whether the other peripheral is broadband or 
narrowband does not matter. In either case, the correct byte 
will arrive eventually in the correct location of the VT2, or 
occupy an entire DSO. 

The second flow is from broadband port (B_out) 1839, to 
narrowband peripheral. This is the more interesting direc- 
tion. Recall that the B_out port has the special property that 
the paddlecards may, or may not, pay attention to the 259'* 
bit of each VT2 frame. In this case, the 259** bit will have 
been set to narrowband in case this was an inter-USM, (or 
SM) port. In that case, the traffic would have been routed to 
the narrowband CM3. In this case, however, the bit is 
ignored since the decision as to how to handle each timeslot 
is deferred to the peripheral fabric 1860. So, the flow 
continues to the broadband paddlecard 1840, travels over the 
optical link to the peripherals* location, and is terminated on 
the peripheral fabric's paddlecard 1840. This paddlecard 
sends the flow to the peripheral fabric 1860, which then 
forwards the first byte of the VT2, and the appropriate 
signaling bits, to the narrowband peripherals 1870. Recall 
that the peripheral fabric can also perform a fanout function. 
Half Call Scenario 3: Broadband Service on Broadband Port 

There are two flows: 

The first flow is from the broadband peripheral 1870, to 
central fabric broadband port, (B_in) 1839. Each broadband 
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stream is transmitted to the peripheral fabric 1860. The 
peripheral fabric understands via provisioned, or dynamic 
connection order, what type of protocol is in the stream. The 
peripheral fabric performs the appropriate segmentation 

5 function, and puts the broadband stream into VT2 cells for 
transmission through the fabric. The flow continues through 
the paddlecards 1850 and 1840, all the way to the B_in port 
1839. As previously described, the broadband flow is 
switched in the central fabric. 

10 The second flow is from the central fabric broadband port 
(B_out) 1839, to the broadband peripheral 1870. The 
B_out stream is more interesting. Remember the 25 9 th bit 
here, will be set to "broadband for this stream. This is a 
peripheral paddlecard, so the 259^ bit is ignored, but also as 

15 previously described, if this was a network port, it would be 
appropriately routed. The flow goes out to the peripheral 
fabric 1860 via paddlecards 1840 and 1850, and upon 
arrival, is processed by the peripheral fabric. The peripheral 
fabric performs the appropriate re-assembly function from 

20 VT2 cells to output stream protocol format. The stream is 
then transmitted to the correct broadband peripheral 1870. 
Half Call Scenario 4: Narrowband Service (Network-Side) 
on Broadband Port 
This call scenario concerns internal (network) connec- 

25 tions. These connections exist to allow the interconnection 
of a multiplicity of central fabrics. So, this represents a 
"internal half-call" that is not seen by the customer, but does 
actually exist, to carry out the interconnection of two cus- 
tomers in cases where one customer exists on one central 

30 fabric, and the other customer exists on a different central 
fabric. In these cases, a second level of central fabric, 
interconnects the first level of central fabrics. 
There are two flows. 

The first flow is 1731 from the narrowband 

35 interconnection, (second level) fabric 1730, to the input port, 
(B_in), of the first level central fabric. The first flow enters 
the paddlecard 1930 at port, (N2_jn), a narrowband port. 
The paddlecard has two choices per timeslot. It can choose 
to use the information either N2 _Jn, or B2 _Jn. In this case, 

40 it will use N2__in, and copy the information signaling +8 bits 
of data to the stream heading to B _Jn. The paddlecard 1930 
also has to append the remaining 31 bytes of "fill" to the 
timeslot, and insert the 259* bit to say narrowband service, 
(3"* signaling bit). Once this is done, the stream continues to 

45 the B^in port, where it can then be copied to any other port 
as previously described. 

One question remains in the above explanation. How did 
the paddlecard know that it needed to use N2 _Jn*s timeslot, 
rather than B2__uTs, timeslot? The answer is that the stream 

50 leaving for N2_out, contained a timeslot signal bit, (259*), 
set to "narrowband". This bit refreshed a storage register in 
the paddlecard ASIC, that is used by the "IN" stream state 
machine as a decision function. So, the outgoing stream's 
information contains the information for the corresponding 

55 input stream as well. This allows the TSI to manipulate the 
paddlecards on a per timeslot basis. Again, as a refresher, 
this allows the interworking with the legacy aspects of the 
system. The narrowband traffic does not know about the 
existence of broadband services. This is a great advantage 

60 because no rework for the existing applications is necessary, 
and universal fabric SMs can work with pre-universal fabric 
SMs by interworking data directly through the second level 
fabric without the applications knowing that anything is 
happening. 

65 TTie second flow is from the output port, (B_out), of the 
first level central fabric 1702, to the narrowband 
interconnection, (second level), fabric 1730. The flow leaves 
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the B_out port with the 259 th bit set to narrowband. Upon 
arrival at the paddlecard 1930, the bit orders the ASIC to 
route it to the N2_out port The first byte of data, and the 
signaling bits are placed in the N2_out stream. The remain- 
ing 31 bytes are ignored from N2_out's perspective. 5 
Simultaneously, the B2_out port is loaded with 32 bytes of 
fill data, since no valid information is present 
Half Call Scenario 5: Broadband Service (Network-Side) on 
Broadband Port 

This call scenario concerns internal, (network), connec- 10 
tions. These connections exist to allow the interconnection 
of a multiplicity of central fabrics. So, this represents a 
"internal half -call" that is not seen by the customer, but does 
actually exist, to carry out the interconnection of two cus- 
tomers in cases where one customer exists on one central is 
fabric, and the other customer exists on a different central 
fabric. In these cases, a second level of central fabric 
interconnects the first level of central fabrics. 

There are two flows. 

The first flow 1732 is from the broadband 20 
interconnection, (second level) fabric 1910, to the input port, 
(B_in), of the first level central fabric 1702. The first flow 
enters the paddlecard 1930 at port, (B2_Jn), a broadband 
port. The paddlecard has two choices per timeslot. It can 
choose to use the information from either N2_in, or B2_jn. 25 
In this case, it will use B2_Jn, and copy the signaling+256 
bits of data to the stream heading to B_in. The paddlecard 
also has to insert the 259 th bit to say broadband service, (3 rd 
signaling bit). Once this is done, the stream continues to the 
B_in port where it can then be copied to any other port as 30 
previously described. 

One question remains in the above explanation. How did 
the paddlecard 1930 know that it needed to use B2_in's 
timeslot, rather than N2 _ in's, timeslot? The answer is that 
the stream leaving for B2-out contained a per timeslot signal 35 
bit, (259 th ), set to "broadband". This bit refreshed a storage 
register in the paddlecard ASIC that is used by the "IN" 
stream state machine as a decision function. So, the outgoing 
stream's information contains the information for the cor- 
responding input stream as well. This allows the TSI to 40 
manipulate the paddlecards on a per timeslot basis. Again, as 
a refresher, this allows the interworking with the legacy 
aspects of the system. The narrowband traffic does not know 
about the existence of broadband services. This is a great 
advantage, because no rework for the existing applications 45 
is necessary, and universal fabric SMs can work with 
pre-universal fabric SMs by interworking data directly 
through the second level fabric, without the applications 
knowing that anything is happening. 

Hie second flow is from the output port, (B_out), of the 50 
first level central fabric 1702, to the broadband 
interconnection, (second level), fabric 1910. The flow leaves 
the B_out port with the 259 fA bit set to broadband. Upon 
arrival at the paddlecard, the bit orders the ASIC to route it 
to the B2__out port. All 32 bytes of data, and the signaling 55 
bits are placed in the B2__out stream. Simultaneously, the 
N2_out port is loaded with 1 byte of fill data, since no valid 
information is present. 

Half Call Scenario 6: Narrowband Service (Network-Side) 
on Narrowband Port 60 

This call scenario concerns internal, (network), connec- 
tions. These connections exist to allow the interconnection 
of a .multiplicity of central fabrics. So, this represents a 
"internal half -call" that is not seen by the customer, but does 
actually exist, to carry out the interconnection of two cus- 65 
tomers in cases where one customer exists on one central 
fabric, and the other customer exists on a different central 
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fabric. In these cases, a second level of central fabric 
interconnects the first level of central fabrics. 
There are two flows. 

The first flow 1731, is from the narrowband 
interconnection, (second level) fabric (CM3) 1730, to the 
input port, (N__in), of the first level central fabric 1701. The 
first flow gets to the first level fabric via transit through the 
narrowband paddlecard, (NCT2/3 termination). The N_in 
port performs the fill bit operations, etc., as previously 
described, so that the narrowband payload and signaling are 
mapped into the VT2, and signaling storage. Once this is 
done, the stream can then be copied to any other port, as 
previously described. 

The second flow 1731, is from the output port, (N_out), 
of the first level central fabric 1701, to the narrowband 
interconnection, (second level), fabric. The N_out port 
performs the appropriate extraction of the DS0 payload and 
signaling bits from the internal VT2+sigoaling format of the 
TSI. The flow then leaves the N_out port of the fabric, and 
proceeds to the second level central fabric via the narrow- 
band paddlecard, (NCT2/3). 

The paddlecard can also provide a 1: N split of the VT2 
stream so that N sets of peripheral fabric can be connected 
to one Broadband Port. This is given the notation 1855 in 
FIG. 18. 1830 contains hard-coded PCM circuit stream 
processing equipment, and no ability to do any kind of 
protocol processing, or switching. This is in direct contrast 
to 1860 which actually interprets the bit stream, and each 
type of stream can be its own protocol and can be packet, or 
circuit switched, per stream. 

The 768 VT2s are split into 6 streams of 128 VT2s each. 
The main reason for this is, that 128 VT2s represents 250+ 
Mb/s of reading and writing from the stream, and 250+ Mb/s 
of reading and writing to' the peripherals, for a total aggre- 
gate of reading and writing and protocol processing, of 1 
Gbps. This is a large amount of processing for 1999 multi- 
protocol, heterogenous processing. From a practical basis, 
this is a good size for being able to process all the different 
protocols for a single processor complex. This also is the 
most cost effective modular unit from a scalability point of 
view. So, the central fabric's "N" ports actually drive "6*N" 
peripheral fabrics. 

CONCLUSION 

The above description is of one of preferred embodiments 
of Applicants' invention. Many other embodiments will be 
apparent to those of ordinary skill in the art without depart- 
ing from the scope of Applicants* invention. The scope of 
the invention is limited only by the attached Claims. 

What is claimed is: 

1. A telecommunications switching network, comprising: 

a plurality of modules for accepting input signals having 
different protocols, and for generating output signals 
encapsulated in a specified protocol; 

a central stage switch for switching signals in said speci- 
fied protocol from input ports of said central stage 
switch to output ports of said central stage switch; and 

said plurality of modules for converting signals of said 
output ports in said specified protocol to any of said 
plurality of protocols of said input signals; 
characterized in that: 

said central state switch comprises apparatus for switch- 
ing only broadband signals, and comprises apparatus 
for converting narrowband signals transmitted to said 
central stage switch into broadband signals, one broad- 
band signal for each narrowband signal; 
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network further comprising: apparatus for adding an 
indicator to all signals converted from narrowband 
signals into broadband signals at said central stage 
switch to differentiate between broadband signals and 
narrowband signals. 5 

2. The network of claim 1, wherein said indicator is an 
extra bit. 

3. The network of claim 1, wherein said central stage 
switch comprises apparatus for adding fill bits to received 
narrowband signals. 10 

4. The network of claim 1, wherein said central stage is a 
time -slot interchange switch. 

5. The network of claim 1, wherein said central stage is a 
time multi-plexed switch. 

6. The network of claim 1, further comprising: 15 
at least one additional one of said telecommunication 

switching networks; 
wherein, the central stage switch of each said telecom- 
munication switching network is connected to at least 
one separate communication module for interconnec- 20 
tion with other central stage switches. 

7. The network of claim 6, wherein one of said separate 
communication modules is a time multiplexed switch. 



8. The network of claim 6, wherein one of said separate 
communication modules is a time-slot interchange switch. 

9. The network of claim 6, wherein said input signals 
comprise narrowband and broadband signals, and wherein 
one of said separate communication modules is for switch- 
ing broadband signals. 

10. The network of claim 1, wherein said input signals 
comprise multiplexed signals from analog POTS, (Plain Old 
Telephone Service) lines. 

11. The network of claim 1, wherein said input signals 
comprise ATM, Asynchronous Transfer Mode, signals. 

12. The network of claim 1, wherein said input signals 
comprise IP, Internet Protocol, signals. 

13. The network of claim 1, wherein said input signals 
comprise frame relay signals. 

14. The network of claim 1, wherein said input signals 
comprise pulse code modulation, PCM, signals. 

15. The network of claim 1, wherein ones of said input 
modules receive inputs from analog narrowband telephone 
lines and Asymmetrical Digital Subscriber, ADSL, lines. 
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